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Abstract 

This paper considers the problem of adaptive estimation of a non-homogeneous intensity 
function from the observation of n independent Poisson processes having a common intensity 
that is randomly shifted for each observed trajectory. We show that estimating this intensity 
is a deconvolution problem for which the density of the random shifts plays the role of the 
convolution operator. In an asymptotic setting where the number n of observed trajectories 
tends to infinity, we derive upper and lower bounds for the minimax quadratic risk over 
Besov balls. Non-linear thresholding in a Meyer wavelet basis is used to derive an adaptive 
estimator of the intensity. The proposed estimator is shown to achieve a near-minimax rate 
of convergence. This rate depends both on the smoothness of the intensity function and 
the density of the random shifts, which makes a connection between the classical deconvo- 
lution problem in nonparametric statistics and the estimation of a mean intensity from the 
observations of independent Poisson processes. 
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1 Introduction 



Poisson processes became intensively studied in the statistical theory during the last decades. 
Such processes are well suited to model a large amount of phenomena. In particular, they are 
used in various applied fields including genomics, biology and imaging. In this paper, we consider 
the problem of estimating nonparametrically a mean pattern intensity A from the observation of 
n independent and non-homogeneous Poisson processes A^^, . . . , A" on the interval [0, 1]. This 
problem arises when data (counts) are collected independently from n individuals according 
to similar Poisson processes. In many applications, such data can be modeled as independent 
Poisson processes whose non-homogeneous intensities have a common shape. A simple model, 
that is well studied for genomics applications [25], is to assume that the intensity functions 
Ai, . . . , A„, of the Poisson processes A^, . . . , are randomly shifted versions Aj(-) = A(- — Tj) 
of a common intensity A, where ti, . . . ,t„ are i.i.d. random variables. The intensity A that 
we want to estimate is thus the same for all the observed processes up to random translations. 
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Basically, such a model corresponds to the assumption that the recording of counts does not 
start at the same time (or location) from one individual to another, e.g. when reading DNA 
sequences from different subjects in genomics |22j . 

In more rigorous terms, let ri, . . . , r„ be i.i.d. random variables with known density g with 
respect to the Lebesgue measure on M. Let A : [0, 1] — )• M+ a real-valued function. Throughout 
the paper, it is assumed that A can be extended outside [0, 1] by periodization i.e. by taking 
X(t) = X{t mod 1) for t ^ [0, 1], where t mod 1 denotes the modulo operation. We suppose that, 
conditionally to ri, . . . ,t„, the point processes A^^, . . . are independent Poisson processes 
on the measure space ([0, 1], ;S([0, 1]), di) with intensities Xi{t) = A(t — Tj) for t £ [0,1], where dt 
is the Lebesgue measure. Hence, conditionally to Tj, iV* is a random countable set of points in 
[0, 1], and we denote by dN} = dN^{t) the discrete random measure XlreAf* ^T(t) for t £ [0, 1], 
where 6t is the Dirac measure at point T. In other terms, conditionally to ti, . . . , r„, one has 
that for any set A £ B{[0, 1]) and for each 1 < i < n, the number of points of lying in A 
is a random variable (A) = dNl = dN'^ (t) which is Poisson distributed with parameter 

X{t — Ti)dt. Moreover, for all finite family of disjoint measurable sets Ai, . . . , Ap of B{[0, 1]), 
the random variables N^{Ai), . . . , N^{Ap), i = 1 . . . , n are independent. For an introduction to 
non- homogeneous Poisson processes we refer to [13]. The objective of this paper is to study the 
estimation of A from a minimax point of view as the number n of observed Poisson processes 
tends to infinity. 

Denote by ||A||2 = Jq \X{t)\'^dt the squared norm of a function A belonging to the space 
L^([0, 1]) of squared integrable functions on [0,1] with respect to dt. Let A C L^([0, 1]) be 
some smoothness class of functions, and let A„ G -^^^([0, 1]) denote an estimator of the intensity 
function A G A, i.e a measurable mapping of the random processes A*, i = 1, . . . ,n taking its 
value in L^([0, 1]). Define the quadratic risk of the estimator A„ as 

7^(A„,A) =E||A„-A||2, 

and introduce the following minimax risk 

7^„(A) = ipf sup7^(A„, A), 

An agA 

where the above infimum is taken over the set of all possible estimators constructed from 
A^, . . . , A". In order to investigate the optimality of an estimator, the main contribution of 
this paper is to derive upper and lower bounds for 7^„(A) when A is a Besov ball, and the 
construction of an adaptive estimator that achieves a near-minimax rate of convergence. 

The estimation of the intensity of non-homogeneous Poisson process has recently attracted 
a lot of attention in nonparametric statistics. In particular the problem of estimating a Poisson 
intensity from a single trajectory has been studied using model selection techniques [TO] and 
non-linear wavelet thresholding [7], [14j . |20j . |23j . Adopting an inverse problem point of view, 
estimating the intensity function of an indirectly observed non-homogeneous Poisson process 
has been considered by [1], [6], [17]. Poisson noise removal has also been considered by [8], [23] 
for image processing applications. Deriving optimal estimators of a Poisson intensity using a 
minimax point of view has been considered in [6] , [19] , [20] |23j , in the setting where the intensity 
of the observed process X{t) = KXo(t) such that the function to estimate is the scaled intensity 
Ao and k is a positive real, representing an "observation time", that is let going to infinity to 
study asymptotic properties. 

In this paper, since we observe n independent Poisson processes, we adopt a different asymp- 
totic setting where n tends to infinity. In this framework, our main result is that estimating A 
corresponds to a deconvolution problem where the density g of the random shifts ti, . . . ,Tn is 
a convolution operator that has to be inverted. Hence, estimating A falls into the category of 
Poisson inverse problems. A related model of randomly shifted curves observed with Gaussian 
noise has been considered by [2] and [3]. The results in [2] show that estimating a mean shape 
curve in such models is a deconvolution problem. However, to the best of our knowledge, the case 
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of estimating a mean intensity from randomly shifted trajectories in the case of a Poisson noise 
has not been considered before. The presence of the random shifts significantly complicates the 
construction of upper and lower bounds for the minimax risk. In particular, to derive a lower 
bound, standard methods such as Assouad's cube technique that is widely used for standard 
deconvolution problems in a white noise model (see e.g. |18j and references therein) have to be 
carefully adapted to take into account the effect of the random shifts. 

The rest of the paper is organized as follows. In Section [21 we describe an inverse problem 
formulation for the estimation of A, and a linear but nonadaptive estimator of the intensity is 
proposed. Section [3] is devoted to adaptive estimation using non-linear Meyer wavelet thresh- 
olding, and to the construction of an upper bound on the minimax risk over Besov balls. In 
Section H] a lower bound on the minimax risk is derived. 



2 Linear estimation 

2.1 Inverse problem formulation 

For each observed counting process, the presence of a random shift complicates the estimation 
of the intensity A. Indeed, for alH G {1, . . . , n} and any / G L'^i[0, 1]) we have 



E 



[' f{t)dNi\T] = [' f{t)X{t-T,)dt, (2.1) 
Jo J Jo 



where E[.|rj] denotes the conditionnal expectation with respect to the variable Tj. Thus 
E C f{t)dNl= ['fit) [ \{t - T)g{T)dTdt = [' f{t){X*g){t)dt. 



Hence, the mean intensity of each randomly shifted process is the convolution X * g between 
A and the density of the shifts g. This shows that a parallel can be made with the classical 
statistical deconvolution problem which is known to be an inverse problem. This parallel is 
highlighted by taking a Fourier transformation of the data. Let {ei)i£z the complex Fourier 
basis on [0, 1], i.e. ei{t) = e^^'^" for alH G Z and t G [0, 1]. For £ G Z, define 

1 rl 

\{t)eii{t)dt and "fi = I g{t)ei{t)dt, 
Jo 

as the Fourier coefficients of the intensity A and the density g of the shifts. Then, for ^ G Z, 
define ye as 



^ n ^1 

ye 



-Y, I e,{t)dNl. (2.2) 
Using (|2.ip with / = e^, we obtain that 

n ..I n 

where we have used the notation 

1 " 

_^gi2^£T,^ V£gZ. (2.3) 



n 

i - 

7£ 



n 

i=l 



Hence, the estimation of the intensity A can be formulated as follows: we want to estimate the 
sequence {6i)i£z of Fourier coefficients of A from the sequence space model 

ye = leOe + ^i,n, (2.4) 
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where the are centered random variables defined as 



Ce,n = - V / ee{t)dNi - / ee{t)X{t - Ti)dt 
™ Uo Jo 



for ah £eZ. 



The model ()2.4p is very close to the standard formulation of statistical linear inverse problems. 
Indeed, using the singular value decomposition of the considered operator, the standard sequence 
space model of an ill-posed statistical inverse problem is (see [5] and the references therein) 

Q = One + ze, (2.5) 

where the 7^'s are eigenvalues of a known linear operator, and the zis represent an additive 
random noise. The issue in model ()2.5p is to recover the coefficients 9i from the observations q. 
A large class of estimators in model (j2.5p can be written as 

where 5 = {5e)e£z is a sequence of reals with values in [0, 1] called filter (see [5] for further 
details). 

Equation ()2.4p can be viewed as a linear inverse problem with a Poisson noise for which the 
operator to invert is stochastic with eigenvalues 7^ (j2.3p that are unobserved random variables. 
Nevertheless, since the density g of the shifts is assumed to be known and given that 

Eji = 'ji 

and 7£ w 7^ for n sufficiently large (in a sense which will be made precise later on), an estimation 
of the Fourier coefficients of / can be obtained by a deconvolution step of the form 

9e = sA, (2.6) 

where 6 = is a filter whose choice has to be discussed. 

In this paper, the following type of assumption on g is considered: 



Assumption 2.1 The Fourier coefficients of g have a polynomial decay i.e. for some real v > Q, 
there exist two constants C > C" > such that C'\l\~'' < \'ye\ < C\l\-'' for all leZ. 

In standard inverse problems such as deconvolution, the expected optimal rate of convergence 
from an arbitrary estimator typically depends on such smoothness assumptions for g. The 
parameter u is usually referred to as the degree of ill-posedness of the inverse problem, and it 
quantifies the difficult of inverting the convolution operator. We will also need the following 
technical assumption on the decay of the density g, which is not a very restrictive condition as 
g is supposed to be an integrable function on M. 

Assumption 2.2 There exists a constant C > and a real a > 1 such that the density g 
satisfies g{x) < ij^^^a for all 2; G M. 

2.2 A linear estimator by spectral cut-off 

First, we propose a non-adaptive estimator in order to derive an upper bound on the minimax 
risk. This part allows us to shed light on the connexion between our model and a deconvolution 
problem. For a given ffiter {5i)i£z and using (j2.6p . a linear estimator of A is given by 

A5(i) = X^^^e^t) = J^<^f77^y^eKi), * e [0,1], (2.7) 
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whose quadratic risk can be written in the Fourier domain as 



1&. 



The following proposition illustrates how the quality of the estimator (in term of quadratic 
risk) is related to the choice of the filter b. 

Proposition 2.1 For any given non-random filter 5, the risk of can be decomposed as 

7^(A^ X) = Y^ \9,\\6e -if + Yl -|7r'l|A||i + -I^^l' d^^l"' - 1) • (2.8) 



where ||A||i = X{t)dt. 
Proof. Remark that 



9e — Bi 



7f 



(2.9) 



i=l 



where the e^^j are centered random variables defined as ee^i = 7^ 

e^W {dNl - X{t - Ti)dt) 

Now, to compute E|0£ — 9(\^, remark first that 
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n ^-^ 



1=1 



If. 



+ 23f?e 



,11 



11 



n 



1=1 



i=l / i,i'=l 



Taking expectation in the above expression yields 



I T" 1 , . . . , T„ 



E 



+E 











' r- n 




6,^-1 


/ 

+ 25Re Oi 


di 1 

7e 


E 


n ^-^ 

i=l 


j |ti, . . . , T„ 



^2 n 

Y E[ef,iQ-7|Ti,...,Tn] 

iA'=l 



Now, remark that given two integers i ^ i' and the two shifts Ti,Ti', ei^i and e^^j' are independent 
with zero mean. Therefore, using the equality 



E 



3 



1 



6j\je\-^E\je - + {6e - 1)^ = (J, - 1)^ + - 1), 

n 



one finally obtains 
E\9e- 



'E 





2 


6,^-1 


+ E 



















52 " 

4^E[|e^,,|2|ri,...,T„] 



n 



i=l 



\lf\ 



1) +E|e^,i|2) . 
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Using in what follows the equality E\a+ib\'^ = E[|ap+|6|2] with a = /J cos(27r^t) {dN^ - X{t - Ti)dt) 
and h = Q sin(27r£t) (diV/ - \{t - Ti)dt) , we obtain 



[ ee{t) [dNl - \{t - Ti)dt) 
Jo 



2 



jel E / (I cos(27r£t)p + | sin(27r£t)p) X{t - Ti)dt = |7£r^||A||i, 







where the last equality follows from the fact that A has been extended outside [0, 1] by peri- 
odization, which completes the proof. □ 



Note that the quadratic risk of any linear estimator in model (|2.4p is composed of three 
terms. The two first terms in the risk decomposition (|2.8p correspond to the classical bias and 
variance in statistical inverse problems. The third term corresponds to the error related to the 
fact that the inversion of the operator is done using (7/)/gz instead of the (unobserved) random 
eigenvalues (7/)iez- 

2.3 Upper bound of the minimax risk on Sobolev balls 

There exist different type of filters in the inverse problems literature (see e.g. [5]). In this section, 
we consider the family of projection (or spectral cut-off) filters 5^ = {Se)^^^ = (ll||£|<7V//})^g2 
for some M G N. Using Proposition 12. H it follows that 



nCx''\x) = Y.m' + l Yl ih^niMii + mmir'-i)). (2.10) 

i>M \e\<M 

Now, consider the following smoothness class of functions (a Sobolev ball of radius A) 

Hs{A) = J A G L^([0,1]) ■,^{1 + \£\^')\9e\^ < A and A(t) > for all t G [0, 1] I , 
I eez J 

and some smoothness parameter s > 0, where Oi = e~'^^^'"^ X{t)dt. For an appropriate choice 
of the spectral cut-off parameter M, the following proposition gives the asymptotic behavior of 
the risk of A'' , see equation (|2.7p . 



Proposition 2.2 Assume that f belongs to Hs{A) with s > 1/2 and A > 0, and that g satisfies 

Assumption Ii2.1]) . If M = is chosen as the largest integer such M„ < n^^+^^ , then as 
n — )• +00 

sup 

\&Hs{A) ^ 



The proof follows immediately from the decomposition ()2.10p . the definition of Hs{A) and As- 
sumption (|2.ip . 

Hence, Proposition 12.21 shows that under Assumption 12.11 the quadratic risk 7^(A'^ , A) is 
of polynomial order of the sample size n, and that this rate deteriorates as the degree of ill- 
posedness u increases. Such a behavior is a well known fact for standard deconvolution problems, 
see e.g. [TS], [12] and references therein. Proposition 12.21 shows that a similar phenomenon 
holds for our linear estimator. Hence, there may exist a connection between estimating a mean 
pattern intensity from a set of non-homogeneous Poisson processes and the statistical analysis of 
deconvolution problems. However, the choice of M„ depends on the a priori unknown smoothness 
s of the intensity A. Such a spectral cut-off estimator is thus non-adaptive and is of limited 
interest for applications. Moreover, the result of Proposition 12.21 is only suited for smooth 
functions since Sobolev balls Hs{A) for s > 1/2 are not well adapted to model intensities A which 
may have singularities. In the following section, we thus consider the problem of constructing 
an adaptive estimator and we study its minimax risk over Besov balls. 
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3 Adaptive estimation in Besov spaces 
3.1 Meyer wavelets 

We will use Meyer wavelets to obtain a non-linear and adaptive estimator. Let us denote by ip 
(resp. (p) the periodic mother Meyer wavelet (resp. scaling function) on the interval [0, 1] (see 
e.g. [181 [12] for a precise definition). The intensity A G L^([0, 1]) can then be decomposed as 
follows 

+00 2J-1 

k=0 3=30 k=0 

where 4>jo,k(t) = 2^''(j){2^°t — k), ^j,fc(t) = 2^ip(2H — k), jo > denotes the usual coarse level of 
resolution, and 

Cjo.fc = / ^{t)(pjo,k{t)dt, Pj,k = / X{t)ipj^k{t)dt, 
Jo Jo 

are the scaling and wavelet coefficients of A. It is well known that Besov spaces can be char- 
acterized in terms of wavelet coefficients (see e.g [E]). Let s > denote the usual smoothness 
parameter, then for the Meyer wavelet basis and for a Besov ball 5^ ^(A) of radius ^ > with 
1 < (7 < oo, one has that 



b;ja) = { f G L\[o,i]) 





p 


^ +00 




1 - 


^ 2j{s+l/2~l/p)q 






\^3=30 



k=0 




with the respective above sums replaced by maximum if p = oo or g = oo. The parameter s 
is related to the smoothness of the function /. Note that p = q = 2, then a Besov ball is 
equivalent to a Sobolev ball if s is not an integer. For 1 < p < 2, the space Bpg(A) contains 
functions with local irregularities. 

Meyer wavelets satisfy the fundamental property of being band-limited function in the Fourier 
domain which make them well suited for deconvolution problems. More precisely, each (pj^^ and 
ijjj^k has a compact support in the Fourier domain in the sense that 



with ^ ^ 

Ci{(p3o,k) = [ e~^''^'<P,,,k{t)dt, cKV'i.fc) = / e~^''^'iP,^k{t)dt, 
Jo Jo 

and where Dj^ and are finite subsets of integers such that i^Dj^ < C2^° , H^^j ^ C2^ for 
some constant C > independent of j and 

% C [-2^+2co, -2^cq\ U [2-'co, 2^+2co] (3.1) 



with Co = 27r/3. Then, thanks to Parseval's relation Cj^^k = YlteDj ^c{^jo,k)^ej Pj,k = J2een.j Cf(^j,fc)^f- 
and from the unfiltered estimator §£ = j^^ye of each Oi, see equation ()2.4p . one can build esti- 
mators of the scaling and wavelet coefficients by defining 

^3o,k = E ^d'^3o,k)^e and /3j,fc = ^ ce{ipj,k)d£- (3.2) 
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3.2 Hard thresholding estimation 

We propose to use a non-linear hard thresholding estimator defined by 

2^0 (")-l ji{n) 23-1 

k=0 j=Mn) k=0 

In the above formula, Sj{n) refers to possibly random thresholds that depend on the resolution 
j, while jo = join) and ji = ji{n) are the usual coarsest and highest resolution levels whose 
dependency on n will be specified later on. Then, let us introduce some notations. For all j G N, 
define 



-I 



and for any 7 > 0, let 

^ / X 1 , 47 log n 

n ^-^ 6n 

1=1 



where Ki = dN'l is the number of points of the counting process A'^* for i = 1, . . . , ri. Introduce 
also the class of bounded intensity functions 

Aoo = {A G L2([0, 1]); ||A||oo < +00 and \{t) > for ah t G [0, 1]} , 

where ||A||oo = supig[o^i]{| A(t)|}. 

Theorem 3.1 Suppose that g satisfies Assumption \2.1\ and Assumption Let 1 < p < 00, 
1 < (/ < 00 and ^ > 0. Let p' = min(2,p), and assume that s > 1/p' and (s + 1/2 — l/p')p > 
u{2 — p). Let (5 > and suppose that the non-linear estimator A^ (|3.3p is computed using the 
random thresholds 




(ll5'l|oo-^n(7) + 5) + ' f"^ ^(iin) < j < Ji(n), 

with 7 > 2, and where a'j and ej are defined in ()3.4p . Define jo{n) as the largest integer such 

that 2-'°(") < log 72 and ji{n) as the largest integer such that 2-'^(") < (^y^^^^'^^^- Then, as 
n — >• +00, 

- I, / / log n \ 2s+2-^+l \ 

sup n{\l\) = o[(^] 

Hence, Theorem 13.11 shows that under Assumption 12.11 the quadratic risk of the non-linear 
estimator Ajj is of polynomial order of the sample size n, and that this rate deteriorates as 
u increases. Again, this result illustrates the connection between estimating a mean intensity 
from the observation of Poisson processes and the analysis of inverse problems in nonparametric 
statistics. Note that the choices of the random thresholds Sj{n) and the highest resolution 
level ji do not depend on the smoothness parameter s. Hence, contrary to the linear estimator 
proposed in Section [21 the non-linear estimator A^ is said to be adaptive with respect to the 
unknown smoothness s. Moreover, the Besov spaces Bp q{A) may contain functions with local 
irregularities. The above described non-linear estimator is thus suitable for the estimation of 
non-globally smooth functions. 

In Section 21 we show that the rate n 2s+2z^+i g, lower bound for the asymptotic decay 
of the minimax risk over a large scale of Besov balls. Hence, the wavelet estimator that we 
propose is almost optimal up to a logarithmic term which is usually called the price to be paid 
for adaptivity. 
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3.3 Proof of the upper bound 

Following standard arguments in wavelet thresholding to derive the rate of convergence of such 
non-linear estimators (see e.g. [H]), one needs to bound the centered moment of order 2 and 4 of 
Cjg^k and I3j^k (see Proposition 13. ip . as well as the deviation in probability between (3j^k and Pj^k 
(see Proposition [321) • the proof, C, C, Ci, C2 denote positive constants that are independent 
of A and n, and whose value may change from line to line. The proof requires technical results 
that are postponed and proved in Section [3. 3. 2[ First, define the following quantities 



i&vij iGfij ' ' ' ten, 



and 



A^fc(7) = ^Uj,k\\l (||9||oo/^n(7)^^^ +^^n(7)) + ^^Uj,k\U (3.6) 

where Knil) is introduced in (|3.5p . Un(7) is a sequence of reals such that Un(7) = o ^ " ^ as 
n — >• +00. 

3.3.1 Proof of Theorem \3A] 

As classically done in wavelet thresholding, use the following risk decomposition 

E||A^ — A II 2 = Ri + R2 + R3 + -^4) 

where 

2J0-1 ji 2^-1 

Rl = Yl ^(^io,A: - Cj,,kf, R2 = J2Y.^ [(^A: " /3j,A:)'l||^^. ,,|>,^,(„)} 
k=0 j=jo k=0 

ji 2^-1 +00 2J-1 

j=jo k=0 j=ji+l k=0 

Bound on R4: first, recall that following our assumptions. Lemma 19.1 of [11] implies that 

23-1 

P% < C2-^^'' , with s* = s + 1/2- 1/p', (3.7) 

A;=0 

where C is a constant depending only on p,q,s,A. Since by definition 2"-'^ < 2(i^^)~2i/+i ^ 
equation (fST]) implies that = O (2-2ii^*) = O ^(l2£I!)-^^ . Note that in the case p > 2, 

then s* = s and thus 2UTT ^ 2S+2I+1 • ^^^^ 1 < P < 2, then s* = s + 1/2 — 1/p, and one 

can check that the conditions s > 1/p and s*p > u{2 — p) imply that 2v+i ^ 2s+2v+i • Hence in 
both cases one has that 

i?4 = O (n~^s+2.+i'^ . (3.8) 
Bound on Ri: using Proposition 13.11 and the inequality 2-"' < logji it follows that 

2jo(2!.+i) ('logn)2'^+i / 2s \ 

i?i < C- < C^^-^ = O { n-^^+^A . (3.9) 

n n \ J 

Bound on R2 and i?3. remark that R2 < R21 + -R22 and R^ < R^i + R32 with 

ji 2J-1 ji 2^-1 

j=jo k=0 j=jo k=0 
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ji 2^-1 



ji 2J-1 



and R,2=J2Y.^ [f^lkh 



j=jo k=0 j=jo k=0 

Now, applying twice the Cauchy-Schwarz inequality, we get that 

jl 2J-1 



{|/3,-fc|<p,(n)} 



{|/9i,fe-/3i,fcl>SiW/2} 



j=jo k=0 
jl 2^-1 



1/2 



Bound on F(|/3j^fc — (3j^k\ ^ 'Sj('T')/2): using that |q < 2"-'/^ one has that ||'i/'j,/fc||2 ^ f'"! and 
||^j,fc||oo ^ Cj- Thus, by definition of Sj{n) it follows that 

2A-,(7) < %(n)/2 (3.10) 

for all sufficiently large n where A"^(7) is defined in (j3.6p . Moreover, by p.ip there exists two 
constants Ci,C2 such that for all i £ Qj, Ci2^ < \£\ < Since \\m.\i\^_^^9ii = uniformly 

for / G Bp it follows that as j — )■ +00 



7^1 



17^1 



o [a]) and = 2-^/2 ^ f4 = ° ('^^ ' 



li>\ 



Now, define the non-random threshold 



,27logn Tlogn \ ^qii\ 

(IblloollAlli + 5) H ^^^3]^ for jo(f^) < J < JiW- (3.11) 



that for all sufficiently large n and Jo(f^) ^ J < ii('T') 



Using that V^? = o((t|) and 5j = o (ej) as j — )■ +00, and that Jo(?^) — )• +00 as n — )• +00 it follows 



'2K;27logn ^losn . 
2[\l ' + Sjl^ I < s,{n)/2 



n 



3n 



(3.12) 



Prom equation (|3.32p (see below), one has that P (^||A||i > Knj < 2n ^ , which implies that 

Sji'iT') < Sj{n) with probability larger than 1 — 2n~^ . Hence, by inequalities (|3.10p and (j3.12p . it 
follows that for all sufficiently large n 



2 max A",(7), 



'21/2^ log n 7logn 



n 



3n 



< Sj{n)/2 



(3.13) 



with probability larger than 1 — 2n . Therefore, for all sufficiently large n, Proposition 13.21 and 
inequality ()3.13p imply that 



\f3j,k - M >Sjin)/2) <Cn-\ 



(3.14) 



for ah jQ{n) < j < ji{n) 



Bound on R21 + -R31: thus, using the assumption that 7 > 2, inequality ()3.7p and Proposition 
13.11 one has that for all sufficiently large n 



R21 + R31 < c 



1 



n 



3=30 j=jo 
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By definition of ji one has that ^ < C for ah j < ji, which imphes that (since s* > 0) 

,1 



R2I + ^31 < C 



n 



9j(2!/+l) 

y - — - + y 2-'^^* 



3=30 



J=JO 



0{n 



(3.15) 



using the fact that 



2j(2^+i) 



< C for ah j < ji{n) < log^ 



n. 



2v+l "^62 

Finally, it remains to bound the term T2 = R22 + -^32- For this purpose, let j2 be the largest 
integer such that 2^'^ < 712^+2^ (log n)^ with /3 = - 2^+21/+! ' partition r2 as T2 = T21 + T22 
where the first component T21 is calculated over the resolution levels jo < j < j2 and the second 
component T22 is calculated over the resolution levels ^2 + 1 ^ J < ji (note that given our 
assumptions then j2 < ji for all sufficiently large n) . Using the definition of the threshold sj (n) 
it follows that 

log(n) (logn)^ 



Sj{ny <C[a^{\\g\\ooKn + S)- 



+ 



-e2 

-.2 J 



n 

From Assumption 12.11 on the ^^s and equation (|3.ip for Vtj it follows that 

(T? < 02^^^ and < C2^i'^+^''^\ 



(3.16) 



Since, for < ^'+^ all j < h, it follows that (i2S^e2 < c2^j.]og{n) ^ 



sjinf < C2^^''{\\g\\^Kn + 6 + 1)^^ 

n 

Using Proposition 13. 11 the bound (|3.17|) . the fact that 



]EA'n< IIAilli + O I 
and the definition of j2 one obtains that 

j=jo k=0 ^ 



logn 



n 



l/2^ 



(3.17) 



(3.18) 



O 



'2^2(2!/+!) 



n 



log(n) 



O In 2s+2^+i (logn) 2^+2^+1 ) . (3.19) 



Then, it remains to obtain a bound for T22. Recall that Sj(n) > Sj{n) with probability larger 
that 1 — 2n~'^, where Sj{n) is defined in ()3.1ip . Therefore, by using Cauchy-Schwarz inequality 
one obtains that 



E 



{^j,k - l3j,k)^^(Sj,k\>Sj{n)/2} < E(/3j- fc - /3i,fc)^a{|/3^. fc|>s,(n)/2} 



+ E(/3,-fc-/3,-fc)^ 



1/2 



{sj{n) < Sj{n))) 



1/2 



Then, by Assumption 12.11 one has that a] > 02^^". Therefore, using Proposition 13.11 it follows 

that E(/3j- fc - fc)2 < Cs2(n) and that E(/3j- ^ - /3j- fc)"^ < for all j < ji. Finally, using that 

7 > 2 and the fact that P(sj(n) < Sj{n)) < 2n~'^ , one finally obtains that for any j < ji 



E 



< c 



s2(n)_ 



22jrl. 



^—Hpj,k\>Sj{n)/2} + 



(3.20) 
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Let us first consider the case p > 2. Using inequality ()3.20p one has that 

/ ii 2^-1 g2(-^^) 22jv 
\j=j2+l k=0 

( ^1 2^-1 1 ^1 9i(2i^+l)^ 

< c[ Y. EiM= + ^ E — 

\j=j2+i fc=o i=i2+i 
Then ()3.7p . the definition of j2, ji and the fact that s* = s imply that 

T22 = O I 2-2^2^ + - ^ I = O (^n'^s+Z+i (log n) ) (3.21) 

i=i2+i 

Now, consider the case 1 < p < 2. Using again inequality ()3.20p one obtains that 

722 < C 2^ ^P^{|/3.,.l>«.{n)/2} + -^+E|/3j,fc| U{|^^,^|<|^^,(„)| 

\j=j2+l A:=0 

/ 2^-1 1 ■''i 9i(2v+l) » 

y=i2+i fc=o i=i2+i 

By Holder inequality, it follows that for any a > 1, Ksj{n)'^~'P < (Esj(n)"*^2~P^) Hence, by 

taking a = 2/(2 - p) it follows that Esj{n)'^~P < (Esj(n)2) Then, using the following 
upper bounds (as a consequence of the definition of s^(ri) and the arguments used to derive 
inequalities (KTf\\ . (IXT8|) ) 

s'Jn) < C2'^^]^ and Es,{nf < Cl'^'^EK,,^^ < 02^^'^^^, 
■' n n n 

it follows that inequality (13:22]) and the fact that for A E B^ ,j{A), EL~o^ \^j,k\^ ^ C2-J>^* (with 
ps* = + p/2 — 1) imply that 

T,. < c(y 2«^a-./.M + ! f 

I ^-^ \ n / n ^ n 

\i=i2+i ^ ^ i=i2+i 

/ /in^riX Jl 1 ■'1 oj(2i/+l) 



o 



n I '■ — ' n — ' n 

j=j2 + l j=j2 + l 

2J2M2-p)-ps*) ^ 1 fl^ 

n I n ^ n 

^ i=i2+i 



= O [n 2S+21.+1 (log n) 2^+2-^+1 j (3.23) 

where we have used the assumption v{2 — p) < ps* and the definition of j2, ji for the last in- 
equalities. Finally, combining the bounds ([SJ]), (IXT5]l . dXTO]) . (l3:2B and ([OH]) completes 
the proof of Theorem 13.11 □ 

3.3.2 Technical results 

Arguing as in the proof of Proposition 3 in [2], one has the following lemma: 
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Lemma 3.1 Suppose that g satisfies Assumption \2.1\ and Assumption \2.SX Then, there exists a 
constants C > such that for any j > and < k < 2^ — 1 

Uj,k\\oo < C2^^''+'/^\ Uj,k\\l < 02''" and Ul^Wl < C2^-(4-+i). 

Proposition 3.1 There exists C > such that for any j > and < k < 2^ — 1 

E|cj, fc - cj^kf < C— (1 + IIAII2II9II00) , E|4-fc - /3j- < C— (1 + IIAII2II5II00) , (3.24) 
and 

nPj,k - < il + -)(! + IIAIllllffllL + IIAII2II9II00 + ||A||i||5||oo) . (3.25) 

Proof : We only prove the proposition for tiie wavelet coefficients Pj^^ since the arguments are 
the same to prove the result for the scaling coefficients Cj^k- Remark first that I3j^k — /3j,k = 
X^^GCj '^ei'^j,k){(^e ~ (^e) = Zi + Z2, where Zi and Z2 are the centered variables 

Zi ■■= (707^ - l)9ece{(t>j,k)- 



and 



where dNf = dNl -\{t-Ti)dt. 



n 1 

1=1 ■'^ 



Control of the moments of Zy. by arguing as in the proof of Proposition 3 in [2], one obtains 
that there exists a universal constant C > such that 

E|Zi|2 < C and E|Zi|^ < C ^ + ^ . (3.26) 

n \ w j 

The main arguments to obtain (|3.26p rely on concentration inequalities on the variables Ti,i = 
1, . . . ,n. 

Control of the moments of Z2- using Lemma l3.ll remark that 

E|Z2|2 = C ^^^{t)\{t-Ti)dt=- C ^l^{t)\^g{t)dt, 

" JO Jo 

22ju 

< C ||A*5r||oo < C IIAII2II5II00. 

n n 

Let us now bound E|Z2|^ by using Rosenthal's inequality [21] 

2p 



E 



1=1 



< 



16p 



log(2p) 



\2p ( f n \P n -\ 



which is valid for independent, centered and real-valued random variables (l^)-j=i...,n. We apply 
this inequality to 1^ = ipj^k{'t)dNl with p = 2. Conditionnaly to Tj, using Proposition 6 in 
|19j and the Jensen's inequality, it follows that 

E[y/|T,] = j%l^{t)\{t-T,)dt + ?>(^j\lk{t)\{t-Ti)d^ , 

< ^lfc(i) (A(t - Ti) + 2,\\t - T,)) dt. 

Jo 
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Hence E Y]l=i < n i>lk{t) (A * g{t) + SA^ ^ g{t)) dt. Then, using Lemma O E Y]I=i < 
Cn2-'^^^"'""^) (||A||2 + ||A|||) lls'lloo- Using again Proposition 6 in [19] and Lemma l3.ll one obtains 
that WY^ = i^{t)\ -k g{t)dt < C2^-''^||A||2||5||oo which ends the proof of the proposition. □ 



Proposition 3.2 Assume that A G A^o and let 7 > 0. Then, there exists a constant C > such 
that for any j > 0, A; G {0 ... 2-' — 1} and all sufficiently large n 



1 / 2K; 7 log n 7 log: n \ \ 

|/3,-fc-/3,-fc|>2max| A5^,(7)),y^i-^ + 5,l^ < Cn-\ (3.27) 

where A"^(7) is defined in i&.6]) . 
Proof : 

Using the notations introduced in the proof of Proposition 13.11 write /3j ^ — /3j ^ = Zi + Z2 
and remark that for any n > 

P(|Zi + Z2I > n) < P(|Zi| > u/2) +P(|Z2| > u/2) (3.28) 

Now, arguing as in Proposition 4 in [2] and using Bernstein's inequahty, one has immediately 
that 



/2V/7logn -/loen 1 
Zi\>\l '\ +^,1^1 <2n-^. (3.29) 



Let us now control the deviation of Z2 = ^Y17=i lo '^j,ki't)d^t- First, remark that con- 
ditionnaly to the shifts Ti,...,Tn, the process J27=i-^^ ^ Poisson process with intensity 
Y17=i'^i- ~ '''«)• For the sake of convenience, we introduce some additionnal notations. For 
n > 1, j > and < /c < 2J - 1, define 



= - E /' - ^0^*' and Mjk = EM;^ = 

^ Jo Jo 



'>Pjf,{t)\^g{t)dt. 



Using an analogue of Bennett's inequality for Poisson processes (see e.g. Proposition 7 in 
19]), we get that for any s > 



P l^lZal > y^M-^ + ^||V;,>||oo|Ti,...,T„j <2exp(-s) (3.30) 

Remark that the quantity M"^ is not computable from the data as its depends on A and the 
unobserved shifts ti, . . . ,Tn- Nevertheless it is possible to compute a data-based upper bound 
for MJ^. Lideed, note that Bernstein's inequality (see e.g. Proposition 2.9 in [15]) implies that 



with Mjk = II A II 00 II II 2- Obviously, Mjk is unknown but for all sufficiently large n, one has 
that 

Mjk = ||A||oo||^j,fc||2 < log"-||^i,fc||i- 



Moreover, remark that Mj^ = llV'jfcV-^ * slli — llV'ifcllillS'lloollAlli. Hence, 



MJ > IMUIIAII, + M + ^M!^ < (3.31) 
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To obtain a data-based upper bound for Mj],, it remains to obtain an upper bound for ||A||i. 
Recall that we have denoted by Ki the number of points of the process A^*. Conditionally to Tj, 
Ki is real random variable that follows a Poisson distribution with intensity \{t — Ti)dt. Since 
A is assumed to be periodic with period 1, it follows that for any i = 1, . . . , n, \{t — Ti)dt = 
X{t)dt, and thus (-f^j)i=i,...,n are i.i.d. random variables following a Poisson distribution with 
intensity ||A||i = Jq X{t)dt. Using standard arguments to derive concentration inequalities one 
has that for any u > 



1=1 / 



where ||A||i = Jq X{t)dt. Now, define the function h{y) = — V^y — a/3 for y > and with 
a = u/n. Then, the above inequality can be written as 



>\iZK^ <2exp(-n). 



Since h restricted on [-y/a(\/30 + 3^/2) / +oo[ is invertible with h ^(y) = y y + ^ + \/\ it 
follows that for u = j log n and all sufficiently large n 



Tuhixii ^ ^ , 47 log 71 / 27 log n - 572(logn)2\ 

P(^||A||i>if. + ^^ + y^^K„ + ^^j <2n \ (3.32) 
where = \ X^^Li-^j- Therefore, using (j3.3ip it follows that 



^\^'q,:>\\h.f.\li,\\^K^(-<)^\^^ <3«-. (3.33) 

where i^n(7) is defined in p.Sp . Hence, combining ()3.30p with s = 7logn and (j3.33p we obtain 
that 



1^2! > 



27 log n II ~ 2 / II II i> I \ , \ 7(logn)2 ^ /27(logn)3^^ ^ 7logn| 



(3.34) 

Combining inequalities (|3.28p . (|3.29p and (|3.34p concludes the proof. □ 



4 Lower bound on the minimax risk 
4.1 Main result 

Theorem 4.1 Suppose that g satisfies Assu'mption \2. 1\ and Assumption \2.SX Introduce the class 
of functions 

Ao = {A G L2([0, 1]); \{t) > for all t e [0,1]} . 

Let 1 < p < 00, 1 < q < 00, A > and assume that s > 2z^ + 1. Then, there exists a constant 
Cq > (independent of n) such that for all sufficiently large n 

2s 

inf sup 7^(A„, A) > Con 2s+2,.+i ^ 
A„ AGB|,,(A)nAo 

where the above infimum is taken over the set of all possible estimators A„ G L^([0, 1]) of the 
intensity X (i.e the set of all measurable mapping of the random processes N'^, i = 1, . . . ,n taking 
their value in L'^{[0, 1]))- 
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4.2 Some properties of Meyer wavelets 

Recall that the Meyer mother wavelet ip is not compactly supported. Nevertheless, Meyer 
wavelet function satisfies the following proposition which will be useful for the construction of a 
lower bound of the minimax risk. 

Proposition 4.1 There exists a universal constant c{tp) such that for any j G N and for any 

{^k)o<k<2^-i G {0,1}'' 

2^-1 
k=0 



sup 



Proof : Note that for periodic Meyer wavelets, one has that 



sup \'4)(x — fc)| < oo. 



'fcez 



Hence the proof follows using the definition of ip^^},{x) = 2^/'^iIj{2^x - k). □ 



4.3 Definitions and notations 

Recall that ti, . . . ,t„ are i.i.d. random variables with density and that for A € Aq a given 
intensity, we denote by A^^, . . . , iV" the counting processes such that conditionally to ti, . . . , r^, 
A^^, . . . , A^" are independent Poisson processes with intensities A(- — ti), . . . , A(- — t„). Then, the 
notation Ea will be used to denote the expectation with respect to the distribution Pa (tensorized 
law) of the multivariate counting process N = {N^, . . . , A^""). In the rest of the proof, we also 
assume that p, q denote two integers such that 1 < p < oo, 1 < (7 < 00, ^ is a positive constant, 
and that s is a positive real such that s > 2z^ + 1, where v is the degree of ill-posedness defined 
in Assumption 12. II 

A key step in the proof is the use of the likelihood ratio A{Hq,Hi) between two measures 
associated to two hypotheses Hq and Hi on the intensities of the Poisson processes we consider. 
The following lemma, whose proof can be found in [3], is a Girsanov's like formula for Poisson 
processes. 

Lemma 4.1 (Girsanov's like formula) Let Mq (hypothesis Hq) and Ni (hypothesis Hi) two 
Poisson processes having respective intensity Ao(t) = p and \i{t) = p + p-(t) for all t € [0, 1], 
where p > is a positive constant and fj, € Aq is a positive function. Let P^i (resp. "^Xq) he the 
distribution of Ni (resp. Nq). Then, the likelihood ratio between Hq and Hi is 



K{Hq,Hi){N) :-- 



dl 



i(AA) = exp 



A() 



p{t)dt + log 1 



P{t) 



dMt 



(4.1) 



where M is a Poisson process with intensity belonging to Kq. 



The above lemma means that if F{N) is a real-valued and bounded measurable function of the 
counting process J\f = Mi (hypothesis i/i), then 



[F{N)] = [F{N)k{HQ,Hi){M)] 

\^ (hypothesis Hi] 



and E//y denotes the 



where denotes the expectation with respect to 
expectation with respect to (hypothesis Hq). 

Obviously, one can adapt Lemma 14.11 to the case of n independent Poisson processes M = 
{M^, . . . AA") with respective intensities Xi{t) = p + Pi{t), t € [0, 1], i = 1, . . . , n under Hi and 
Aj(t) = p, t £ [0, 1], i = 1, . . . , n under Hq, where pi, . . . , pn are positive intensities in Aq. In 
such the Girsanov's like formula ()4.ip becomes 



AiHo,Hi)iM) = llexp 



Pi{t)dt+ / log 1 + 



Pi{i) 



dMl 



(4.2) 
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4.4 Minoration of the minimax risk using the Assouad's cube technique 

Let us first describe the main idea of tlie proof. In Lemma we provide a first result giving a 
lower bound on the quadradic risk of any estimator over a specific set of test functions. These 
test functions are appropriate linear combinations of Meyer wavelets whose construction follows 
ideas of the Assouad's cube technique to derive lower bounds for minimax risks (see e.g. [9lll8]). 
A key step in the proof of Lemma 14.21 is the use of the likelihood ratio formula ()4.2p . Then, we 
detail precisely in Lemma 14.31 the asymptotic behavior of the likelihood ratio (j4.5p defined in 
Lemma 14.21 under well-chosen hypotheses Hi and Hq. The result of Theorem 14.11 then follows 
from these two lemmas. 

Given an integer D > 1, introduce 

SD{A) = {f G Aon Bl^iA) I (/,Vi,fc) = 0Vj/Z5VA:G{0...2^'-l}}. 

For any u = (wfc)fc=o,... ,2^5-1 ^ {0,1}^" and i G {0, . . . , 2^ - 1}, we define G {0,1}^^ as 
= ujj^,\/k ^ I and u)^ = 1 — uj£. In what follows, we will use the likelihood ratio formula ()4.2p 
with the intensity 

Ao(t) = p(A) = ^,VtG [0,1], (4.3) 

which corresponds to the hypothesis Hq under which all the intensities of the observed counting 
processes are constant and equal to A/2 where A is the radius of the Besov ball Bp^{A). Next, 

for any a; E {0, 1}^^^^, we denote by Xd,uj the intensity defined as 

2^-1 

Ad,^ = P(.A) +^dY1 "^kMk + Cd2^/2c(^), with = c2-'"^'+^/^\ (4.4) 

k=0 

for some constant < c < A/{2+c{ip)), and where c{ip) is the constant introduced in Proposition 
14.11 For the sake of convenience, we omit in what follows the subscript D and write instead 
of Xd,lu- First, remark that each function A^^; can be written as A^j = p{A) + /U^j where 

2^-1 

Mc^ = ?D ^ Wk^lJD,k + ^d2^/^c(^), 

k=0 

is a positive intensity belonging to Aq by Proposition 14.11 Moreover, it can be checked that 
the condition c < A/{2 + c(V')) implies that A^^; G Bp^g{A). Therefore, A^; G Sd{A) for any 
U! € {0, 1}^^. The following lemma provides a lower bound on Sd- 

Lemma 4.2 Using the notations defined above, the following inequality holds 

.2 . 2^-1 

inf sup Kx\\\n-X\\'>^-^J2 E ^xJlAQkAN)], 
withN = (iVi,...,iV") and 



kn nr=i exp 


- li /^^^ (i - ^^)dt + J^' log (l + 


)dNi 


g{ai)dai 


nr=i exp 


- /o pUt - a,)dt + f^' log (l + ^^^) 


dNi 


g{ai)dai 



Q-K^i^) = : V—. 7. 7 ,. .1 • (4.5) 



Proof : Let A„ = A„(A^) G -^^^([0, 1]) denote any estimator of A G Sd{A) (a measurable function 
of the process N). Note that, to simplify the notations, we will drop in the proof the dependency 
of \n{N) on N and n, and we write A = \n{N). Then, define 

R{\)= sup EA||A-Af. 

AG5_D(yl) 
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Since A^j G S^i^) for any u € {0, 1}^ , it follows from Parseval's relation that 



2^-1 



RiX)> sup ExJX-\J^> sup \^D,kW - ^k^D\^ 

c^6{0,iP° u;G{0,1}2-^ fc=0 

where we have used the notation /3_D.fc(A) = (A, 4'D,k)- For all k G {0, . . . , 2^ — 1} define 

Wfc = i^k{N) := arg min |/3D,fc(A(iV)) - v^dI- 

dG{0,1} 

Then, the triangular inequality and the definition of (2}^ imply that 

Cnli^k - Wfcl < \ukCD - /3z),fc(A)| + |/3D,fc(A) - iOk^ol < 2|^D,fc(A) - ujkCnl- 

Thus, 

.2 2^-1 

i2(A) > ^ sup Ex^Y\u;k{N)-Ukf, 



> 



4 22" 



2^-1 



(4.6) 

Let /c G {0, . . . , 2^ — 1} and w G {0, 1}^^ be fixed parameters. Conditionally to the vector 
T = (ti, . . . T„) G M", we define the two hypothesis Hq and as 

ffo: -^^5 • • • 1 are independent Poisson processes with intensities (Ao(- — ti), . . . , Ao(- — t„)) = 
(Ao(-)5 • • • ) Ao('))) where Aq is the constant intensity defined by ()4.3p . 

H^: N^, . . . , iV" are independent Poisson processes with intensities {Xui{- — ti), . . . , Ai^(- — t„)). 

In what follows, we use the notation (resp. ^hz) to denote the expectation under the 
hypothesis Hq (resp. H^) conditionally to r = (Ti,...Tn). The Girsanov's like formula ()4.2p 
yields 



with dr = dri , . . . , (iT„ and 



A{Ho,H:,){N) = lle^p 



1=1 



EHi\u)k{N) - ujf,\'^g{Ti) . ..g{Tn)dT 

Eh, [\Cok{N) - cok\^A{Ho,H:,){N)] g{n) . . . g{Tn)dT, 



fl^{t-Ti)dt+ / log 1 + 



for N = (N^ , . . . , N"^). Now, remark that under the hypothesis Hq, the law of the random 
variable uJk{N) does not depend on the random shifts r = (ri, . . . ,t„) since Aq is a constant 
intensity. Thus, we obtain the following equality 



Uk{N) - / A{Hq, Hl){N)g{n) . . . g{Tn)dT 



EAjwfc(iV)-Wfc|2=Ej^„ 



Using equality (j4.7p . we may re- write the lower bound (j4.6p on -R(A) as 



(4.7) 



R{\) > 



4 22 
4 22 



2"-l 

2^-1 



iit(Af)-unP / A(ff„,fl;)(W)9(Ti)...9(T„)dT 

Wfc(iV) - a;,.|2 / A{Ho, Hl){N)g{n) . . . g{Tn)dT 



E 



^==0 wG{0,l}2"|«)fe=l 

\uk{N)-ot\^ [ A{HQ,Hl,){N)g{Ti)...g{Tn)dT 



+ 
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Using the inequality |1 — v\'^z + \v\'^z' > z Az' that holds for all v € {0, 1} and all reals z, z' > 0, 
we deduce that 

R{X) > ^ ^ / ^iHo,H-)iN)gin)...g{Tn)dTA, 



> 



AiHo,HZ^,)iN)g{n)...g{Tn)dT 



C2 1 2^-1 



V V / A(^0, F:)(iV)9(Tl) . . . g{Tn)dT (lA 



4 22" 

Ji^„ A{Ho,H2,){N)g{ai) . . . g{an)da 
A{Ho, HS){N)g{ai) . . . g{an)da 

^ 4 ^ E E / ^H,[KHo:H:,){N){lAQk,^{N))]g{n)...g{Tn)dT, 

p ^ /i^n A(go, gg,)(iV)ff(«i) . . . g{an)da 

^''"^ ^ J^^A{Ho,HS){N)g{ai)...g{an)da' 



where 



and da = dai . . . dan- Then, using again the Girsanov's like formula (|4.2p . we obtain the lower 
bound 

RW>-f^Y. E EaJiaq,,.], 

that is independent of A which ends the proof of the lemma. □ 

We detail in the next paragraph how to use Lemma 14.21 with a suitable value for the parameter 
D to obtain the desired lower bound on the minimax risk. 

4.5 Quantitative settings 

In the rest of the proof, we will suppose that D = Dn satisfies the asymptotic equivalence 

n 1 

2^" ~ 77,2^+2-^+1 as n +00. (4.8) 

To simplify the notations we will drop the subscript n, and we write D = Dn- For two sequences 
of reals (an)n>i and {bn)n>i we use the notation a„ x 6„ if there exists two positive constants 
C,C' > such that C < < C" for all sufficiently large n. Then, define mD„ = 2^"/2^d„. 
Since Cd„ = c2--^"(^+V2) ^ it" follows that 

as 7^ — >• oo. Remark also that the condition s > 2z^ + 1 implies that 

nml^ X ^-(.-2.-i)/{2s+2.+i) ^ Q 

as n — )• oo. 
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4.6 Lower bound of the "likelihood ratio" Q 



k,uj 



The above quantitative settings combined with Lemma [4.2l win allow us to obtain a lower bound 
of the minimax risk. For this purpose, let < 5 < 1, and remark that Lemma 14.21 implies that 



inf sup EaIIA - Af > ^-i, Yl E ^>^- (.QUN) > S) . (4.9) 



2^-1 



A„ Ag5d(A) 4 22 



'^=0 ajG{0,l}2'°|«)fe=l 



The remainder of the proof is thus devoted to the construction of a lower bound in probability 
for the random variable Qk,u]{N) := ^ where 



h=h{N) 



r _ _ r /"i /*i 

/ ri exp - / {t - ai)dt + log (1 + ^u^fe (t - ai)) dNl 
Jr" I Jo Jo 



g{ai)dai 



and 



h = h{N) 



JJexp 

i=l 



li^{t-ai)dt+ I log {1 + fi^{t - ai)) dNl 



g{ai)dai 



where to simplify the presentation of the proof we have taken p{A) = 1 i.e. A = 2. Then, the 
following lemma holds (which is also valid for p{A) 7^ 1). 



Lemma 4.3 There exists < 5 < 1 and a constant po{6) > such that for any k £ {0 ... 2 
1}, any uj G {0, l}^^" and all sufficiently large n 



Dn 



Pa. {Qk,JN) >6)> po{5) > 0. 



1/2 



Proof : For a function A G L'^{[0,1]), we denote by ||A|| = yf^ \X{t)\'^dtj its L2 norm and 
by ||A||oo = sup(g[o,i] {l'^(*)l} its supremum norm. In the proof, we repeatedly use the following 
inequalities that hold for any oj G {0, l}^^" 



I pUJ 1 1 — II PllJ 1 1 00 

I ^ui 1 1 — II Atj 1 1 00 

< p{A) + 2c(^)mD„ ^ p{A) = 1/2, 



(4.10) 



as n — )• +00. Since for any k, one has Jq ipD,kit)dt = 0, it follows that for any u and a, 
Jo Pu,{t - a)dt = c(V')Cd„ 2^-/2 = c(^)mz)„. Therefore, 



and 



h = I g{ai) . . . <7(aJe-^('^)"™^« J] 

1=1 



5(ai)...5(a„)e-'=W"'"^"n«^P 

i=l 



log {I + p^,{t-ai))dNi 



log {1 + pUt-ai))dNi 



da, 



da. 



Let z > be a positive real, and consider the following second order expansion of the logarithm 

log(l + z) = z 1 for some 1 < n < 1 + z. (4-11) 

2 3 

Applying (|4.11|) implies that 

u?,.(i - ai) 



f 

Jo 



log{l + p^k{t-ai)) dNl < / </i<^fc(t-ai) 



dNl+ [\l,{t-a,)dNi, 
Jo 

(4.12) 
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and that ^ 

/ log {1 + fi^{t-ai))dN'i> 
Jo Jo 

Then, remark that inequahties ()4.10p imply that 



dNl 



(4.13) 



Ea. / 4,,{t-ai)dNl 



t^ik{t-ai) / \^{t - Ti)g{Ti)dTidt 



< ll^a;fe||oo||/^<:Dfef IIA^Iloo = O {m%J 

Therefore, by Markov's inequahty it follows that fl^^.(t — ai)dNl = Op {in\)^ as n — )■ +00. 
Hence, using inequality (j4.12p . one obtains that 



h < e 



-c{ip)nTnun +Op {nm- 



» n 

5n) / g{ai) . . .5(a„)TTexp 



fJ-uiit - ai) 



dNl 



da. 



and by inequality ()4.13p it follows that 

» n 

h > e-=W"™^" / g{a^)...g{an)llexp 



1=1 



lj,^k{t - ai) 



^l At - ai) 



dm 



da, 



Combining the above inequalities and the Fubini's relation we obtain that 



QkA^) > e 



nr=i/R5(ai)exp 




|/i^fc(t - ai) 




dai 


n"=i/R9(ai)exp 




- ai) 




dai 



Let z G M and consider the following second order expansion of the exponential 

exp(z) = 1 + z -\ 1 exp(u) for some —\z\ <u< \z\. 

2 6 



(4.14) 



(4.15) 



Let us now use (j4.15p with z = {fJ'Cjk{t — ai) — ^fl'i^.{t — ai)} dNl. By inequalities ()4.10p . one 
has that 



fi^k{t - ai) + ^f4;kit -ai)] I X^{t - Ti)g{Ti)dTidt, 
1, 



Therefore, \z\ = Op {nif)^) by Markov's inequality. Since m£)^ —^0, we obtain by using (|4.15p 
that for each i G {1, . . . , n}, 



exp 



fi^k{t - tti) - ^/i?fc(t - ai) } dNl 



1+1 f^cokit - ai)dNl - ^ I f,l,{t - a,)dNl 



1 

+ 2 



fi^k{t-ai)dNl-^ I fil,it-ai)dNi) +Op{mlj, 



1+ ii^k{t - a^)dNl - - n^_,(t-ai)dNl + -[ fi^k{t - ai)dNl \ +Op{m%J, 



21 



where we have used the fact that 



^i^,{t - oii)dNi - - iil,{t - ai)dNl 

. From the definition of Ji in (|4.14|) . we can use a stochastic version of the Fubini theorem (see 
|10j . Theorem 5.44) to obtain 



1 + 



g{ai)fi^k{t - ai)dN'ldai - - / g{ai)fj,'^k{t - ai)dNldai 



n 

i=l 



n 

1=1 



1+/ 9*IJ^a,^{t)dNl-^ I g*fil,it)dNl 



At this step, it will be more convenient to work with the logarithm of the term Ji. We have 



n p „i 

In(Ji) = Vln 1+ g 
*=i L Jo 



* f,^,{t)dNi - - / g*^il,{t)dN^ 



+^ / 9{ai){ I fiok{t - ai)dNl ] doi + Op{m%J 



Using again the second order expansion of the logarithm (|4.1ip . we obtain that 



In(Ji) = E / 



g * fM^, {t)dNi -\ I g^fiL it)dNl 



+ 7; I ai^i) { I f^o^t - Oii)dNl ] da 



Using similar arguments for the term J2 defined in ()4.14p . we obtain that 



n r „i 

ln(J2) = E / 



g-kn^{t)dNi 



g*^il{t)dN, 



+ / 9{ai)\ / f^Ut-ai)dNi] doi - - ( / g*fi^{t)dNi] + Op(m^J 
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Combing the above equalities for Ji and J2, we obtain the fohowing lower bound for ln(Qfc^^(A^)) 
HQkAN)) > HJ,)-ln{J2) + Op{nmlj 

= Ei^^- (/ +^lb*A,f -i||5*A^fef (4.16) 



2° ^ ,1 



1 



(4.17) 
(4.18) 
(4.19) 



1, 



^ g-kn^k{t)dNlj +-\\g*Xa;4 + 
3 



^ (^l\*fiUt)dNi^ - i||g*A^f }(4.20) 



-Op(nm^J 



In what follows, we will show that, for all sufficiently large n, the terms (|4.16p - (|4.20|) are bounded 
from below (in probability). Since nm^^ — )• 0, this will imply that there exists c > (not 
depending on A^^;) and a constant p{c) > such that for all sufficiently large n 

Pa. (ln(Qfc,^(iV)) > -c) = Pa. (QkA^) > exp(-c)) > p{c) > 

which is the result stated in Lemma [ 



Lower bound for ()4.16p : since for any 1 < i < n 



g-k{X^k{t) - Xu,{t)}{g -k X^{t)}dt. 



We obtain that 



E 

i=l 



Ea. ( / g*{Xo'^{t)- XUt)}dNi + -\\g*XJ^ --\\gkX^,f 



n 



Remark that /x^^ — = zizS^o'ipDk- In what follows we will repeatidely use the following relation 



g\\'= ^\^l>Dk^g{t)fdt= V |Q(VDfc)|'|7d'^2-2^- (4.21) 



which follows from Assumption 12 . 21 combined with Parseval's relation, from the fact that ij^^D 
2^ and that under Assumption 12.11 17^1 >i 2~^'^ for all i Qd. Therefore 



=eD / {i^Dk^giwdt^en'^ 

Jo 



Therefore, 



E 

i=l 



Ea. ( / 9*{Xck{t)-XUt)}dNi + -\\g*X^f--\\gkX^kf 



1, 



which implies that there exists a constant < cq < +00 such that for all sufficiently large n the 
deterministic term ()4.16p satisfies 



/ /■ 1 1 

= E [J^ 9* i^-^ (*) - ^Ut)}dNl + -\\g* A^f - -||g * A, 



> -Co- 
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In the rest of the proof, we show that, for all sufficiently large n, the terms (|4.17p - (j4.20p are 
bounded from below in probability. Without loss of generality, we consider only in what follows 
the case - l-i'Cui' = ioi'Dk- 

Lower bound for ()4.17p : rewrite first ()4.17p as 



gi7D = -eDV/ 5*^D,fc(tw, 
i=i 

where dNl = dNl — \{t — T.j)dt. Then, using the fact that, conditonnaly to n, . . . the 
counting process Yll=i a Poisson process with intensity YL^=i -^<^(* ~ follows from an 

analogue of Bennett's inequality for Poisson processes (see e.g. Proposition 7 in [19]) that for 
any y > 



n „i 

^=l 



< 



+^yCD|b*V'D,fc||oo|Ti,...,r„^ > 1 -exp(-2/) 



Since YIi=i \9 '^D,k{t)?^Lj{t - Ti)dt < n||5r * V'D,fc|Pl|At^l|oo for any n, . . . ,t„, it follows that 
for y = log(2) 



n „i 



< J2\ogi2)eDn\\g^iJD,k{tWMoo +-log(2)CD|b*V'D,fc||oo > 1/2. 



Now, using that Ci)"'ll5'*^Z),fc(i)lPl|Acj||oo ^ 1 and ^D||5'*^Z),fc||oo < ||V'I|oo2^/^Cd 0, it follows 



that there exists a constant ci > such that for all sufficiently large n 

n „i 



'(|(|lIZI)|<ci) 



" pi 

/ g*i;D,kit)dNi 
i=i -^0 



< ci > 1/2. 



(4.22) 



Lower bound for ()4.18p and ()4.19p : define 

"1 



g^^,l{t)dNl- \ I g*f,l,{t)dN^ 



+^ / 9{<^i) { I licjk{t - ai)dNl ] da, 



g{cii)[ I fJ.uj{t - ai)dNn dui 



and note that ()TOD + (|TOD = X;r=i ^i- any 1 < i < 



n 



gyoLi 



H^k {t - ai)g -k \^{t)dt 



fii^{t - ai)g -k Xu,{t)dt] ] dai 



(/i^(t - ai) - n^k{t - Oi)) g -k Xi^{t)dt ) I da. 



£,DipD,kit - ai)g-k fi^{t)dt] { / (;U^(t - aj) - /i^fe(t - aj))5*;U^(t)dt dai 



which implies that 
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Therefore ^^=i^x^Xi — >• as n — ^ +00, since nm'fj — t- 0. Now, remark that Xi, . . . ,X„ are 
i.i.d variables satisfying for ah 1 < i < n 

where Ki = dN^. Conditionally to Tj, Ki is a Poisson variable with intensity X^{t — Ti)dt = 
Jq \u{t)dt = ||Ai^||i. Hence, the bound ()4.10p for ||A;^||oo and inequality (|4.23p implies that there 
exists a constant C > (not depending on A^) such that 

which implies that Var(^"^-|^ Xi) = nVar(Xi) < nEXf — t- as n — ?■ +00 since nmj^ — ?• 0. There- 
fore, (|4.18|) + (|4.19|) = J27=i -^i converges to zero in probability as n — )• +00 using Chebyshev's 
inequality. 

Lower bound for (|4.20p : we denote by Si the difference 



Si := 2 [^-^ (^J^ g^fi^k{t)dNi^ + ^Ib * A^^ f + ^ g * ^i^{t)dNi^ - ^\\g * XJ\^ 
and remark that ()4.20p = ^ XlILi ^i- Fi^'st, remark that 



OJ 1 1 I ) 



^Xu^Si = \\g-k\^k\\ -\\g-k\uj\\ + / {g*f^io) {t)g -k \^{t)dt - I {g-kfj^o'') {t)g -k \^{t)dt 

Jo Jo 

+ {9*^^oJ){t)Xul{t-Ti)dt^ ~{/ ^3-*'l^wk){t)Xojit-Ti)dt^^dTi. 

Since \\g * fi^k |p — \\g * fj-uiW^ = \\g * A;;^*; |p — -k A^jp and g -k = 1 + g -k fi^^ it follows that 



^Xc^Si = / {g-k fJ'cu)'^{t)g-k Huj{t)dt - {g-k ix^kf{t)g-k ii^{t)dt 
Jo Jo 



{g -k ficu){t)X^{t - Ti)dt} {g* fJ'o'<:)it)^uj{t - Ti)dt^ jdTi. 



One has that 

|5'i,i| < Wnujfoo + ll/^LDfc|lLll/^^lloo < 16c^(V')m|,^, 

and that 



g*ipD,k{t)X^{t-Ti)dt\ [ I gkr{fi^ + fi^k){t)X^{t-Ti)dt] ] dn 



Hence using ()4.10p and ()4.2ip it follows that there exists a constant C > such that for all 
sufficiently large n 

\Si,2\ < ^hWg^i^DMl (IIm^IIoo + \\t^u4oo) < Cn~^^^^ 

Then, since s > 2z/ + 1 > it follows that 



i=i ^ ^ 
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Now, note that Var(^"^^ Si) = nVar(Yi) where 

Since |Yi| < + ll^ii,* -^i with Ki = dN^ being, conditionally to ti, a Poisson 

variable with intensity X^{t — Ti)dt = X^{t)dt = \\Xui\\i. Therefore, using (|4.10p . it follows 
that there exists a constant C > (not depending on A^^) such that 

n 

Var(^ Si) = nVar(yi) < nEY^ < Cnm% 0. 

i=l 

Therefore, using Chebyshev's inequality, we obtain that ()4.20p = ^ Si converges to zero in 
probability as n — ?■ +00, which ends the proof of the lemma. □ 

4.7 Lower bound on B^'''{A) 

By applying inequality (|4.9p and Lemma |4.3| we obtain that there exists < 5 < 1 such that 
for all sufficiently large n 

inf sup Ex\\Xn-X\\l>CeDj''\ 

for some constant C > that is independent of D^ - From the definition ()4.4p of and using 
the choice (|4.8p for D„, we obtain that 

inf sup ExfXn- X\\l>CChj^" ^n'^^vt+i. 
An XeSoiA) 

Now, since Sd{A) C Bp^q{A) for any L> > 1 we obtain from the above inequalities that there 
exists a constant Cq > such that for all sufficiently large n 

inf sup n2s+2t+iE;v||A„ - AII2 > inf sup Ea||A-A|||, 
A„ Aei?»,,(A)nAo A„ Ae5B„(A) 

2s 

> Con 2s+2^+i^ 

which concludes the proof of Theorem 14.11 □ 
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